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ABSTRACT 

The analysis of the shear induced by a single cluster, acting as a gravitational 
lens, on the images of a large number of background galaxies is all centered around 
the curl-free character of a well-known vector field that can be derived from the data; 
in particular, the mass reconstruction methods, currently producing many interesting 
q ; astrophysical applications, are all based on this tensorial property of the induced 

shear. Such basic property breaks down when the source galaxies happen to be 
observed through two clusters at different redshifts, partially aligned along the line 
of sight, because an asymmetric part in the Jacobian matrix associated with the ray 
tracing transformation is expected, now that the incoming light rays are bent twice. 
In this paper we address the study of double lenses and obtain five main results, (i) 
First we generalize the procedure to extract the available information, contained in 
the observed shear field, from the case of a single lens to that of a double lens, (ii) 



Then we evaluate the possibility of detecting the signature of double lensing given 
the known properties of the distribution of clusters of galaxies. In particular, we 
show that a few configurations are likely to be present in the sky, for which the small 
effects characteristic of double lensing may already be within detection limits (i.e., 
we show that the signal involved is expected to be larger than what could be produced 
by statistical noise, which includes effects associated with the distribution of the 
source ellipticities and with the spread in redshift of the lensed galaxies), (iii) As a 
different astrophysical application, we demonstrate how the method can be used to 
detect the presence of a dark cluster that might happen to be partially aligned with 
a bright cluster studied in terms of statistical lensing; if the properties of the bright 
cluster are well constrained by independent diagnostics, the location and the structure 
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of the dark cluster can be reconstructed, (iv) In addition, we show that the redshift 
distribution of the source galaxies, which in principle might also contribute to break 
the curl-free character of the shear field, actually produces systematic effects typically 
two orders of magnitude smaller than the double lensing effects we are focusing 
on. (v) Remarkably, a discussion of relevant contributions to the noise of the shear 
measurement has brought up an intrinsic limitation of weak lensing analyses, since 
one specific contribution, associated with the presence of a non- vanishing two-galaxy 
correlation function, turns out not to decrease with the density of source galaxies 
(and thus with the depth of the observations). In our mathematical framework, the 
contribution of the small asymmetry in the Jacobian matrix induced by double lensing 
is retained consistently up to two orders in the weak lensing asymptotic expansion. 
The analysis is then checked and exemplified by means of simulations. 

Subject headings: cosmology: gravitational lensing — cosmology: dark matter — 
galaxies: clustering — galaxies: distances and redshifts 



1. Introduction 

The study of statistical gravitational lensing effects on large numbers of source galaxies due 
to intervening matter has been a subject of major and rapid progress both from the methodological 
and from the observational point of view (e.g., see Kaiser & Squires 1993; Luppino & Kaiser 
1997; Taylor et al. 1998; Lombardi & Bertin 1998a, b, hereafter Paper I, II). This indeed offers a 
very promising approach to study the overall mass distribution, especially on the scale of clusters 
of galaxies. In fact, the most appealing aspect of this line of research is its diagnostic potential, 
with the possibility to measure mass distributions independently of the traditional tools that have 
to rely on the use of dynamical models. 

The interest in this method has gained further momentum from the most recent developments 
of telescopes and instrumentation. Very deep images are now bringing in accurate data that allow 
us to analyze the shape of thousands of galaxies, with a density of nearly 200 galaxies arcmin~ 2 
already reached (see Hoekstra et al. 2000). The use of information from so many sources makes it 
possible to measure very small lensing effects, which would have been beyond expectation just a 
few years ago. 

The work in this area is thus mostly addressing two directions. On the methodological side, 
the main hope is to develop mass reconstruction algorithms that are simple, flexible, and reliable, 
with full control of the errors involved in the procedure that goes from the shear data to the 
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inferred properties of the lens (see, e.g., Bartelmann et al. 1996; Seitz & Schneider 1997; Paper I; 
Paper II). On the side of data analysis, a variety of important issues have to be properly faced 
and addressed, in order to secure accurate shape measurements as free as possible from all the 
undesired or spurious distortions that are associated with the instrument and with the observation 
conditions (Kaiser, Squires, & Broadhurst 1995). 

Within this general pespective, the present paper belongs to a third direction of research, 
i.e. the study of potentially interesting effects and phenomena, such as the lensing associated 
with gravity waves (Kaiser & Jaffe 1997) or cosmological applications where gravitational 
lensing probes the geometry of the universe (Kaiser 1998; Lombardi & Bertin 1999, Paper III; 
van Waerbeke, Bernardeau, & Mellier 1999). Measuring these effects, admittedly at the limits 
of current observations, is expected to become feasible with the advent of the next generation 
telescopes. 

The theory of double lenses has already been subject to some progress. For example, it 
has been proven that multiple lenses produce an odd number of images from a pointlike source 
(Seitz & Schneider 1992) and that a simple theorem on the magnification of images holds (Seitz 
& Schneider 1994), while Crawford et al. (1986) have discussed the probability that a quasar is 
strongly lensed by two clusters. However, this still remains a challenging line of research that is so 
far largely unexplored; in particular, the problem has not been discussed, to our knowledge, in the 
important context of weak lensing analyses. Attracted by the new developments in deep imaging 
and especially in the observation of very distant clusters of galaxies, which already suggest that a 
chance alignment of two clusters is not to be considered an unlikely event (for example, see the 
case of C10317 + 15 noted by Molinari, Buzzoni, & Chincarini 1996, and of A1758 reported by 
Wang & Ulmer 1997), we found it natural to give further thought to such lens configurations. 

In this paper, for the context of weak lensing, we generalize the procedure to extract the 
astrophysical information contained in the observed shear field from the case of a single lens to 
that of a double lens. We start by showing how the trace and the asymmetric part of the Jacobian 
matrix associated with the ray-tracing transformation can be measured (Sect. 2 and Sect. 3), once 
the observed ellipticities of a large number of background galaxies have been properly secured. 
We then consider some explicit questions that can be tackled in configurations of astrophysical 
interest; in particular, we formulate and address a "dark cluster problem," corresponding to the 
situation where the signature of double lensing is observed at variance with the apparent absence 
of a second bright cluster partially aligned along the line of sight (Sect. 4). We then estimate the 
size of the effects involved (Sect. 5), so that we can state that, based on the known properties of the 
distribution of bright clusters, a few configurations are likely to be present in the sky, for which the 
small effects characteristic of double lensing may already be within detection limits. Surprisingly, 
a discussion of the relevant contributions to the noise of the shear measurement reveals an intrinsic 
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limitation of weak lensing analyses. In fact, we find that the noise contribution associated with the 
two-galaxy cosmological correlation function does not decrease with the depth of the observation. 
A quantitative evaluation of this subtle effect and of its impact on mass reconstructions deserves 
a separate investigation and will be considered in a future paper. Some numerical tests in Sect. 6 
demonstrate that the analytical framework developed is sound and bring out the existence of some 
curious effects (especially on the criticality condition for a double lens and on the possibility of 
pinpointing the location of the invisible cluster in the dark cluster problem). Then, simulations in 
the same section show that the redshift distribution of the source galaxies, which in principle might 
also contribute to break the curl-free character of the shear field, actually produces systematic 
effects typically two orders of magnitude smaller than the double lensing effects we are focusing 
on. In addition, they suggest that reasonable assumptions on the observed galaxies lead to noise 
levels that do not mar the possibility of detecting the double-lensing effect. Even if, at present, 
much of what we have obtained is barely within the limits of direct experimental confirmation, 
with the advent of new instrumentation (such as ACS on HST) and powerful telescopes of the 
next generation (such as NGST) the observational impact of analyses of the type provided here is 
bound to become significant. 



2. Basic relations 

In this section we briefly recall the lensing equations for double lenses, in the context where 
the lensing is produced by intervening clusters. For the purpose, we mostly follow Schneider, 
Ehlers, & Falco (1992), with some small differences of notation (see also Paper III). We then 
summarize the principles of the statistical analysis that leads to a local measurement of the 
gravitational shear. 



2.1. Ray-tracing equation 

Let us consider for the moment a single source at redshift z s . Let D(zi, z%) be the angular 
diameter distance between two aligned objects at redshift z\ and z®, and Dij = D(zS % \ z^') (with 
i,j E {o, 1, 2, s} and = z^°' < z^ 1 ' < z^ < z^) the distances between two of the elements 
of the lens configuration, made of observer (o), two deflector planes (1 and 2), and source plane 
(s). We call 0^ (or simply 6) the apparent position of the source, so that a;* 1 ) = D o i0^' is the 
corresponding linear position on the first deflector plane. The light ray is traced back to angular 
positions 0^ 2 ' = x^ / D o2 and 6^ = x^ / D os referred to the second deflector and to the source 
(see Fig. [j]). For simplicity, in the following we will use the notation s for 0® . The "dynamics" 
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of the two lenses is contained in two deflection functions ct^ and a.( 2 \ such that 

x ^ = D 02 e^-D l2 ^\D 0l e^) , (i) 

x^ = D os 0« - D„aW {D ol 6^) - /W 2) {D o2 0^) . (2) 
Thus the ray-tracing equation can be written as 

s = - (0) - /3 (2) (0 - A/3 (1) (0)) , (3) 

with (£=1,2) 

= 2± a ®{D*0) (4) 

-^os 

and 

U\ s U o2 

For given projected mass distributions of the two lenses S^, the distances enter the 
problem through A and through the two critical densities, defined as 

{00 for z < z^ , 

c 2 Dos . (6) 

— — otherwise . 

The definition of Sc for z < just states that foreground sources are unaffected by the 
lens. The functions /3 W (0) can be expressed by suitable integrals of the two reduced densities 

cases of interest, A is smaller than unity. Note that, much like in the case 
of a single lens, since distances enter only through the ratios D ls / D os and D 2s / D os , for a family 
of very distant sources the ray-tracing equation (Q) can be applied without explicit reference to the 
distance of each source, and A remains finite. 

If the sources are located at different redshifts but not all of them are at large distance, we 
can still retain a form similar to Eq. @ for the ray-tracing equation provided we introduce the 
appropriate cosmological weight functions: 

w V)f z ) = . (7) 



The functions w^'(z) give the strength of the lens £ on a source at redshift z relative to a source at 
redshift z s . [Note that in Paper III we used a similar definition of cosmological weight function, 
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but took the "reference" redshift 2 s at infinity. Here we prefer to avoid the limit z s — > oo.] With 
this definition, Eq. (Q) can be rewritten as 

s = 0- u> (1) /3 (1) (0) - w {2) (3 {2] (0 - A/3 (1) (0)) . (8) 

An important qualitative aspect of the two-lens ray-tracing equation is the following. In the 
limit where A — » 0, the ray-tracing equation adds two contributions, (3^(0) and (3^(0), each 
deriving from a potential, so that the related Jacobian matrix A = 80 s / 80 is symmetric. In turn, 
when A / 0, the Jacobian matrix is no longer guaranteed to be symmetric (in fact, the product 
of two symmetric matrices is not necessarily symmetric). The limit A — > occurs when the two 
deflectors are very close to each other. 

In the weak lensing limit the Jacobian matrix is considered to be close to the identity matrix, 
A = Id + 0(e), with the parameter e measuring the strength of the lens. Equation (|]) readily 
shows that for two lenses of comparable strength one expects a very small asymmetry in the 
Jacobian matrix, with A\ 2 — A 2 \ = 0(Ae 2 ). In the following we will find that, in spite of the 
smallness of the effect involved, a weak lensing analysis, carried out to second order, can provide 
interesting indications as to the possibility of detecting, in realistic cases, the presence of a double 
lens through its signature of an asymmetry in the Jacobian matrix. Simulations will show that 
such indications are further encouraged when the lenses involved are relatively strong. 



2.2. Local statistical analysis 

Consider a large number iV of extended sources all subject to the same Jacobian matrix A for 
the relevant ray-tracing. In general, as noted above, A ^ A T . Let Qij be the measured quadrupole 
moments of the individual galaxies. These observed quantities are related to the source (unlensed) 
quadrupole moments Qf - by the equation (see Schneider, Ehlers, & Falco 1992; see also Paper I) 

Q s = AQA T . (9) 

In the limit iV — > oo, the mean value of the source quadrupoles for an isotropic population of 
source galaxies should be proportional to the identity matrix 

(Q s )=MId, (10) 

with M a positive constant. This is the starting point of the Q-method described in Paper I and 
allows us to invert the relation for the mean values 



A(Q)A T = (Q s ) = Mid , 



(11) 
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into 

AA T = M((Q))~ 1 . (12) 

Note that the matrix (^4 T ) is symmetric. Here we have taken (Q) to be non-singular. 

The process of mass reconstruction makes use of the departures from the identity matrix of 
the ray tracing matrix as brought out by the observed quadrupoles. In practice, the true Jacobian 
matrix A is considered to be unknown. Equation ( jl2| ) shows that if A is a solution, any matrix 
A = UA obtained by multiplication by an orthogonal matrix (so that UU T = Id) is also a 
solution. Thus in the determination of the shear associated with gravitational lensing the Jacobian 
matrix is bound to be identified only up to an orthogonal matrix. It is easy to show that any 
solution A for a problem for which A is a solution can be written as A = UA , with U a suitable 
orthogonal matrix. 

The same set of data {Q^ } can be analyzed by a standard single lens procedure, leading to 
the determination of a symmetric Jacobian matrix A B , with the property A s = Aj. Then A s is to 
be related to the true Jacobian matrix A by means of an orthogonal matrix. In fact, for a given 
Aq, there are in general four different symmetric matrices available (corresponding to matrices 
that differ by the sign of their eigenvalues). If we restrict the attention to transformations with 
positive determinant (the det U — ±1 ambiguity reflects the well-known g i— > 1/g* invariance; 
see Schneider & Seitz 1995), we can write 

. / cost sinr\ 

A= [ . Mo , (13) 



— sin t cos r 



with 



£kk'A okk > , 1AS 
tanr = . (14) 

Here is the totally antisymmetric tensor of rank 2 and A kk = Tr(v4 ) is the trace of A (we use 
the summation convention on repeated indices). Note that the quantity £kk'Aokk' — ^012 — ^021 is 
a measure of the asymmetry of A . 

The symmetric matrix can thus be given in explicit form: 

^ = ±i(A ^ 0mm + ^^, fcfc ,^), (15) 

with 

f 2 = (Ao mm ) 2 + (ekk'Aokk 1 ) 2 ■ (16) 



-8- 



In what has been described so far, A is considered to be the same for all the source galaxies. 
Therefore at this stage we have addressed only the problem of a local analysis, applicable to 
source galaxies very close to each other, in a small patch of the sky, and at a similar redshift. In 
conclusion, we have so far shown that a local measurement cannot lead to discovering a double 
lens, because there is always a way to interpret the data by means of a symmetric ray-tracing 
Jacobian matrix. 



3. Non-local analysis in the weak lensing limit 

Let us first consider only sources located at a given redshift z. In the weak lensing limit, the 
Jacobian matrix for such sources is of the form A Q = Id + 0(e), and the asymmetry is small, 
Skk'Aokk' = 0(e 2 ). Thus we can write the related symmetric matrix (the "effective" Jacobian 
matrix) as 

V 72 <? ~ 7i/ \^02i - r A Q2 2 J 



with (see Eq. Q) 



r~- l -{A m2 -A mi ) = 0{e 2 ). (18) 



Therefore the elements of the symmetric matrix (appearing in the mass reconstruction analysis) 
can be expressed in terms of those of the true Jacobian matrix, as 

a = ^Tr(A ), (19) 

71 = ^(^011-^022), (20) 

72 = ^012 + ^021). (21) 

From inspection of Eqs. ( JT8| ) and ( |T9| ) and from the definition of the true Jacobian matrix, we find 
that the true source positions are related to a and to r by 

V • 9 s = Tr(A ) = 2a , (22) 
WA0 s = -(A 012 -A 021 ) = 2t. (23) 

Thus, if we refer to the vector field 11(6) commonly used in weak lensing analyses, we find 

/7i,i + 72^ 1 

u 



V 2 s . (24) 

72,1 - 71,2/ 2 
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If we now combine the definition of u{0) with the above relations for o and r we get 

V • u = -VV , (25) 
VA« = -V 2 r . (26) 

The last two equations replace the well-known relations applicable to single lens analyses, i.e. 
u = — Vcr and V A u = 0. 

Suppose now that the source galaxies follow a redshift distribution p(z). At each redshift, we 
can apply Eqs. (jT7|-|2~6l) provided that all lensing quantities, such as A , a, 7, and r, are calculated 
for each source at the correct redshift. In the weak lensing limit, the shear 7 is estimated from the 
ellipticities of galaxies that are observed in a particular region of the sky. Since source galaxies 
have different redshifts, in reality a mean value of the shear is measured: 



(7>,= / l(z)p(z)dz, (27) 
Jo 

where 7(2) is the redshift-dependent shear. In the weak lensing limit, all quantities depend 



linearly on the shear, and thus Eqs. (|25J) and ( [26D can be written as 

V-(u) z = -V 2 (a) Zj (28) 
V A(u} z = -V 2 (r) z . (29) 

Unfortunately, these relations cannot be used to infer directly, from a given set of data, the 
amount of asymmetry that would be a characteristic signature of multiple lensing. The reason is 
that for this purpose the data should be able to identify the vector field u(0) up to 0(e 2 ), because 
we know that r = 0(e 2 ). In practice, it is well-known that standard weak lensing analyses lead to 
the determination of u(6) only up to 0(e). In fact, observations lead to the determination of the 
reduced shear g{6) = r y(6)/a(0); the identification of g with 7 is correct only to 0(e). Thus the 
above analysis is not yet ready for practical applications. For this reason we need to extend the 
discussion so as to include the second order terms in the weak lensing expansion. In principle, the 
discussion can be carried out by retaining the redshift dependence of sources. In practice, such 
approach would lead us far beyond the original purpose of this paper. For this reason we will 
suppose, in the following, that all sources are located at redshift z s (in Sect. 5 we will consider 
again the spread of sources in redshift for the single lens case). 

For a population of source galaxies located at a single redshift (even in the case of strong 
lensing; but in this section, we recall, we are still within the weak lensing expansion) an 
"observable" field is (see Kaiser 1995) 

a= 1 + * «» Wai + feA (30) 

1- M V 92 1 - 9iJ \92,i ~ 91,2/ 
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for a strong single lens, such field has the important property that u = V [ln| 1 — k{0) |] , with k(0) 
the dimensionless projected density that one aims at reconstructing. Curiously, to second order in 
the weak lensing expansion (see Appendix A), it is possible to show that, for multiple weak lenses 
of comparable strength, 

V u = V 2 (\na) , (31) 
V A u = V 2 r . (32) 

In other words, we have derived a set of equations for which the contribution of a small asymmetry 
present in the true Jacobian matrix is retained consistently; the structure is similar to that of the 
set for u{6) field presented in Eqs. ([25]) and (|26|). The new set ofEqs. ( |3T| ) and ( j32| ) is the basis 



that allows us to generalize the procedure to extract the available information, contained in the 
observed shear field, from the case of a single lens to that of a multiple lens and thus to investigate 
quantitatively the characteristics of the coupling of two or more deflectors located along the same 
line of sight. The symmetric limit, which we may call in this context the single lens limit, is easily 
recognized. 

The results of this section (Eqs. ( PH ) and (|32|)) are not generalized easily to the case of 
sources distributed according to a p(z). However, the simulations described in Sect. 6.2 will 
basically support a description analogous to that given by Eqs. ( |2~8| ) and (|29|). 

The conclusions of this section, that the data contained in the shear map can be used to detect 
an asymmetry in the true Jacobian matrix A , and of the previous Sect. 2.2, that such a detection 
is impossible if based on a local measurement only, do not depend on the number of deflectors 
involved in a multiple lens. In the following, we will specialize our conclusions to the study of 
double lenses, for which the ray-tracing equation can be handled in a straightforward manner. 



4. Double lenses in the astrophysical context 

In this section we formulate some questions that may be interesting from the astrophysical 
point of view. The expected size of some effects and the possibility of an actual measurement will 
be addressed in separate sections at the end of the paper. 



4.1. Where to look for the signature of double lensing? 

The prime signature of double lensing would be the detection of a significant asymmetry r 
(see Eq. (E2I)). Because of statistical errors, the measured vector field u is bound to be associated 
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with a non-vanishing curl. Therefore, a positive detection of asymmetry can be claimed only if the 
expected statistical error on r is smaller than the true value r characterizing the double lens. 

A quantitative analysis of this condition will be provided in Sect. 5 below. Here we only note 
that, in general, for two clusters of comparable strength, at different distances, with offset centers 
with respect to the line of sight, the regions on the sky where the signal-to-noise ratio for r should 
be largest are those, on either side, just off the line connecting the cluster centers (see Fig. |3] and 
Fig. 0) 

As we have seen in Sect. 2, the asymmetry is expected to be weak (C(Ae 2 ) for two clusters 
of comparable strength e). The optimal conditions for detecting double lensing are then: 

• The geometric parameter A should be close to unity, i.e., for a given distribution of sources, 
the distance between the two clusters should not be much smaller than the distance from 
the observer to the near cluster. This condition is reasonably satisfied if, for example, the 
redshifts of the two clusters are in the following relation, z^ ~ Az^\ 

• The two clusters should be not too weak. It is preferable to consider cases where the 
dimensionless densities, kP^ and k^, are of order unity. 

• The cluster centers should be offset, but there should be a region of significant overlap 
between the two clusters in the sky. 



4.2. Probability of cluster alignments 

Given the above criteria, we now estimate the probability that a double lens with desired 
properties be observed. For the purpose, we refer to H = 65 km s _1 Mpc -1 for the Hubble 
constant, within a Friedmann-Lemaitre cosmological model characterized by Q = 0.3 and 
tt A = 0.7. 

First, we suppose that a nearby cluster is observed at redshift « 0.1 (the values of the 
various quantities used here are similar to the values used in the simulations to be described in 
Sect. ||) and ask what is the probability of finding a cluster at redshift z^ m Az^ well aligned 
with the first cluster; in considering this question, we further require that the second cluster be 
sufficiently massive, with mass greater than 5 x 10 14 M . For the purpose, we assume that 
the angular distance between the centers of the two clusters be between 2 and 4 arcmin and 
that the redshift of the second cluster be in the range 0.3 < z^ 2 > < 0.5. A simple calculation 
then shows that the center of the second cluster must be inside a comoving volume with size 
V ~ 10 360 Mpc 3 . Therefore, based on the cluster density found by Borgani et al. (1999; see 
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also Girardi et al. 1998), the expected probability of finding a "good" double lens turns out to be 
~ 0.3%. If we consider less strict requirements, with an angular distance between the two cluster 
centers in the range 1-5 arcmin and a redshift range 0.2-0.6 for the second cluster, the estimated 
probability increases up to « 1%. 

These estimates should be combined with the number of massive clusters at small redshift 
(for the object observed at z^) that should be available over the whole sky. The comoving volume 
for a shell with redshift between 0.05 and 0.15 is ~ 1.34 x 10 9 Mpc 3 ; correspondingly, the total 
number of clusters with mass above 5 x 1O 14 M in such a volume is about 350. As a result, we 
expect from 1 to 3 cases of double lenses with detectable effects and configurations similar to the 
one considered in our simulations. From the way we have approached the problem, it is clear that 
the estimate just obtained provides only a lower limit with respect to the number of good cases 
expected for the purposes of the present paper. 



4.3. Mass reconstruction for double lenses 

Suppose that a set of data leads to the determination of u and that, based on suitable boundary 
conditions (e.g., r(0) — > 0, cr(0) — > 1 for ||0|| — > oo), we integrate Eqs. ( |3~1"| ) and ( |B~2~| ) and get the 



functions er(0) and r(0). For a single lens problem the lensing analysis, at this stage, would be 
complete, since we would have the dimensionless projected mass distribution k(0) = 1 — cr(0) 
associated with the deflector; knowledge of the critical density S c would lead to the projected 
mass distribution £(0). In the case of a double lens, what would be the astrophysically interesting 
quantities that could be derived? 

For the following discussion, the ray-tracing equation can be re-cast in a more transparent 
form in the case z = z^ s ' (so that = = 1). If we introduce the two curl-free lensing maps 
(we recall that V A f3 {e) = 0) 

(0) = 0-A/3 (1) (0) , (33) 
C (2) (6>) = 0- A/3 (2) (0) , (34) 

and the effective ray-tracing map 

C{6) = (1 - A)0 + A0 S (0) , (35) 
we see that Eq. dSh can be written as 

C(0) = C (2) (C (1) (0)) • (36) 

In other words, ray-tracing for a double lens can be seen as the composition of two curl-free 
functions. 
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The double lens problem in its direct formulation involves two (dimensionless) ray-tracing 
functions (/3 {1) (0) and /3 (2) (0)), two critical densities and E< 2) ), and one additional 
geometric parameter A. Any inverse problem would be under-determined, if we start from the 
two dimensionless functions a(0) and r(0) alone. Even if the geometry is taken to be known (for 
example, when the distances to the two clusters acting as deflectors are known), it is not possible 
from the pair (a, r) to "reconstruct the two lenses," i.e. to get the ray-tracing functions (3^ and 
(3^ separately. This statement is easy to understand, especially if we refer to the curl-free lensing 
maps and £( 2 ' of Sect. 2.1. In fact, a simple scaling invariance C,^ h- > kC^\ l— *■ k^ 1 ^ 2 ^ 
shows that, unless we are able to provide suitable boundary conditions on the functions £^(0) 
(e.g., C (^) — *■ 1 f° r large ||0||X there is no way to disentangle from the shear data alone the 
contributions of the two lenses. Obviously, this does not mean that the lensing analysis is useless; 
it is only a reminder that in this more complex situation an unambiguous mass reconstruction 
based on weak lensing would require additional input from other probes of mass distributions 
(e.g., X-ray data). 

In order to clarify this point further, it may be instructive to consider the following simple 
examples, all characterized by r = 0. 



4.3.1. Aligned, centrally symmetric lenses 

Suppose that 

C(0) = ^|C s (l|0||) , (37) 

with ( s (9) a real continuous function such that C s (0) = 0. Then it is natural to choose £0) and C^ 2 -* 
to be radial, i.e. 

C (1) (0) = j^C (1) (l|0|l)> (38) 

C p0 W = p|C W (lW > 09) 

where and are real functions. Supposing that C^(0) > for all 9, we can rewrite Eq. d36t ) 
as 

C (2) (C (1) (0))=C s (0)- (40) 

Thus reduced to a one-dimensional problem, it is clear that this equation in general admits an 
infinite number of solutions. 
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4.3.2. The general case of double lenses without asymmetry (r = 0) 

Consider an effective ray-tracing map (see Eq. (|35|)) such that V A £ s = V A 6 s = 0. In this 
case we can solve Eq. ( [36| ) by choosing an invertible curl-free map t^ 2 - 1 and by writing 

C (1) (0) = « {2) r\C(e)) ■ (4i) 

We now require that Q 1 ' be curl-free. From the above equation, this happens if and only if the 
product of the Jacobian matrix A^~ 2 \ associated with (C^) » an d ^4 S , associated with £ s , is a 
symmetric matrix, i.e. 

A { ~ 2) A S = (A { - 2) A S ) T = A S A { ~ 2) . (42) 

From linear algebra we know that this is equivalent to saying that A^ 2 ^ and A s have the same 
eigenvectors. Since C^ 2 -* is curl-free, let us introduce the potential <f)^ so that £^ = and 
= If £ = £ 2 ) is a local system of coordinates where A s is diagonal, then Eq. ( |4"2] ) 
is satisfied if A^ is diagonal in the same system of coordinates, i.e. if d 2 (p^ / <9£i<9£ 2 = 0. 
This simply states that (p^ is separable in £i and £ 2 , i-e. that we can write as the sum of an 
arbitrary function of £i and of an arbitrary function of £ 2 - This again demonstrates the freedom 
at our disposal in solving Eq. (06|). In addition, one can now better appreciate why, even when 
natural boundary conditions are specified, in general there is no guarantee that the solution be 
determined uniquely; in fact, several solutions are expected when the coordinate system £2) is 
associated with poles (such as the point r = for polar coordinates). 



4.3.3. Two lenses with no net lensing? 

One curious case that may be imagined is the possibility of combining two lenses with no 
net effect, so that = ( see Eq. (|35l)). In principle, one may argue that from any invertible 

curl-free map £^(0) one can take a second lens characterized by C^(^) = (C^W) • This 
idea would obviously generate an infinite number of solutions since the starting function C} 1 ^ 
is at our disposal. However, if we require that the density distributions associated with the two 
lenses be both positive definite, i.e. that V • < 2, it can be shown that there is no way for two 
lenses to exactly compensate for each other: in other words, the only admissible solution is the 
trivial C (0) — C (&) = 0- This conclusion is consistent with the theorem that states that any 
combination of gravitational lenses is bound to produce a net magnification (/i > 1; see Seitz & 
Schneider 1992). 
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4.4. The dark cluster problem 

A different problem based on double lenses can be formulated in the following way. Suppose 
that one observes a lensing cluster and that in the process of producing the mass reconstruction 
one finds evidence that the vector field u is not curl-free, well above the expected errors. Clearly, 
in such a situation, one possibility is that a separate mass concentration, which we may call a dark 
cluster, is responsible for the effect. We may now consider the case when the mass distribution of 
the visible cluster is well constrained by diagnostics independent of lensing (e.g., by X-ray data). 
Under these circumstances, what can we tell about the dark cluster properties from the observed 
lensing effects? In particular, is it possible to derive the location and the mass distribution of the 
invoked dark cluster? 

If the mass distribution of the luminous cluster is taken to be known, one has either the 
function £^ or the function depending on whether the dark cluster is near or far; in either 
case, the deflection associated with the dark cluster (3^ = (6 — £^)/A depends implicitly (see 



Eqs. (p3|)-(p5|)) on the geometric parameter A, which is unknown. One way to obtain the value of 



A is to impose that the lensing map associated with the dark cluster is curl-free, i.e. that either 



V A 

or that 

V A 



c s A((ciT>)) 



ci 2) ((c s A r>)) 



, (43) 



, (44) 



depending on whether we guess the dark cluster to be near or far. In Sect. 6 below, by means of a 
simulated case, we will show how A can be determined with reasonable accuracy, by minimizing 
the square of the left side of the above equations. In other words, under the above circumstances 
we will demonstrate that a weak lensing analysis allows us to pinpoint the location of the invisible 
cluster and to reconstruct its mass distribution. 

In closing this section, we may note that the problem of a single dark cluster, i.e. the case of 
a matter concentration detected by lensing effects without a visible counterpart for the lens, is less 
constrained and, in this respect, less interesting than the case discussed above in the sense that the 
usual mass reconstruction would only lead to the dimensionless projected density k, with no hope 
to derive the distance to the dark cluster that is invoked and the actual scale of the mass involved. 



5. Size of the double lensing effect 



In this section we calculate the expected order of magnitude for the measurable effects 
associated with double lensing. For simplicity, the following section discusses the noise properties 
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of quantities related to the vector u. The results obtained are the leading order estimates for 
the desired quantities related to u. The following subsection deals with a population of source 
galaxies located at a single redshift. Effects related to a spread in redshift will be estimated in 
Sect. 5.3. 



5.1. Expected variance of V A u for a single lens 

Consider a single lens, characterized by true Jacobian matrix A , so that the shear field u 
has vanishing curl, i.e. r = 0. Because of the finite number of source galaxies used and of the 
smoothing introduced in the reconstruction process, we expect that the measured u in general 
differs from the true field u . 

In order to calculate the expectation value and the variance of V A u, we may use a technique 
similar to that described in Paper II. We first recall that in the reconstruction process Eq. ( |l"2| ) is 
used together with a (positive) weight function W(0, 6'). Such weight function, defined so that 
W(0, 6') is significantly different from zero only for 0' close to 6, enters in the local averages of 
quadrupoles (or ellipticities). In particular, if we resort to the so-called Q-method associated with 
Eq. ( O ) (see Paper I), the local average quadrupole (Q)(0) is calculated from the relation 

(Q)(0) = ^ ^ f«— . (45) 

Here the superscripts (n) label the galaxies affected by lensing. In Paper II we have calculated 
expectation values and variances of several quantities, under the hypothesis that the weight 
function is invariant upon translation, so that W(6, 6') = W(6 — 6'), and that it is normalized, i.e. 

p J W{0) d 2 9 = l. (46) 

Here p is the density of galaxies (taken to be constant over the field). In particular, we have shown 
that the expectation values of the relevant quantities (in particular, the measured shear, the field 
u, and the reconstructed mass density k) are equal to the true quantities smoothed by the weight 
function, e.g. 

(u)(0) = p J W{0 - 0')u o (0') d 2 9' . (47) 



From this expression it can be easily shown that the expected value for the measured curl of u 
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vanishes. In fact, 



(V A u){0) = pV e A j W{0 - e')u o (0') d 2 9' 

= p J W{0')V e A u o (0 - 0') d 2 9' = . 



(48) 



Here we have used the symmetry property of convolutions. Thus, for a single lens, we expect 
to measure (in the average) a field u with vanishing curl. Note that, for a double lens, a similar 
calculation would show that 

(r){0) = pJ\V(0- 0')t o {0') d 2 9' . (49) 

Let us now turn to covariances. Using the same notation as in Paper III (see also 
Lombardi 2000), we call c the covariance of the ellipticity distribution for the population of source 
galaxies ((efep = cdij). There we have shown that the covariance of the shear 7 is 

Cov 4J ( 7 ;0,0') = (( 7i (0) - dm)^') ~ (7;>(0'))> 

= cp5 iS J W(0 - 0")W{0' - 0")d 2 6" . (50) 

Note that the covariance of 7, which we may write in short as Cov( 7 ), depends only on — 0' . 
Then the covariance of u is related to the covariance of 7 in a simple way: 

Cov(m) = -V 2 Cov( 7 ) . (51) 

Similarly, it can be shown that 

Cov(V A u) = -V 2 Cov(m) = V 2 V 2 Cov( 7 ) . (52) 

The last expression is the basis of the following discussion. In fact, we may argue that a lens 
can be considered to be double only if the measured value of V A u is significantly larger than the 
expected error on this quantity. If we refer to Eqs. (jTS|), ( |T9"| ) as definitions of a and r, we thus find 
(under optimal conditions defined in Paper II) 

Cov(a) = Cov(r) = Cov( 7 ) . (53) 

A more useful expression can be found in the simple case of a Gaussian weight function: 
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In this case we have 



Cov ij (7; 6) 



Cov(V A u; d) 



c5 



1.1 



A-npa 



2 exp 
w 



(^ w -\\s\\ 2 )' 



32a w 



x exp 



(55) 



x 



(56) 



where we have called 8 = 6 — 6'. Note that VAm shows anti-correlation for ||<5|| in the range 



aw Similar noise properties are associated with the quantity V • u. 



5.2. Condition for the detection of double lensing 

Now we are finally able to set a quantitative condition for the detection of double lensing. 



The requirement that the error on r (which can be estimated from Eqs. ( p3[ ) and (|55|)) be smaller 
than the expected value of r, within factors of order unity, can be written as: 



AkVkW*J-^. (57) 



Note that payy represents the number of galaxies effectively used in the average of Eq. (|3j). 
If we refer to a case with kS 1 ' ~ ~ A ~ 1/2 and to the currently "realistic" density 
p ~ 100 gal arcmin~ 2 , with c ~ 0.03, we thus find that a significant signal requires the use of a 
Gaussian weight function with smoothing size aw not smaller that 10". On the other hand, the 
interesting effects are those associated with the structure of the field r(6), which is associated 
with a length scale at best comparable with that of the clusters under investigation. Therefore, 
we may argue that the most promising way to identify the effects of double lensing is to work 
with the smallest smoothing size a w compatible with condition (|57|). These arguments and 
the applicability of the condition defined by Eq. ( |57| ) are clarified and further supported by the 
simulations that will be presented in Sect. 6. 



5.3. Effects related to the spread in the distances of the source galaxies 

So far, in this section, we have referred to a sheet of source galaxies at a given distance. In 
real situations where the source galaxies are distributed at different redshifts, we expect three 
separate effects that can have a significant impact on our analysis: 
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• An extra source of noise, related to the distribution in redshift of the individual sources, is 
added to our data. As a result, the measured shear will have a larger covariance than the one 
calculated in Eq. (|5T]), and thus a larger covariance is also expected for VAm. 

• An additional effect is induced by the extra-clumping of the sources described by the 
cosmological two-point correlation function (see Peebles 1980). 

• Outside the limit of very weak lensing, there is actually no guarantee that V A u = even 
in the standard case of a single lens. This happens because the observable is, in this case, 
(g) z , which is not simply related to k and 70 . 

The first two items contribute to the estimate of the relevant covariances while the third item adds 
a bias (to be discussed in Sect. 5.3.2). 



5.3.1. Extra contributions to the covariance 

Below we will estimate the covariance of the measured shear 7 when the source galaxies are 
located at different (unknown) redshifts. For the purpose, we assume that the redshift distribution 
of galaxies is known. In particular, we write the probability to find one galaxy in an area d 2 9 of 
the sky with redshift between z and z + dz as 

P 1 = pp( z )d 2 6dz . (58) 

Here we model the galaxy angular distribution in terms of a constant galaxy density p on the lens 
plane. The probability to find two galaxies, one in a patch d 2 ^ 1 ) of the sky with redshift in the 
range \zP-\ z^ + dz^] , and one in a patch d 2 8^ with redshift in the range [z@\ + dz^] , 
can be written as 

P 2 = [p 2 p(z^) P (z^) d 2 0« d*W d 2#(2) dz (2)] + e(r) ] (59) 

Here £(r) is the two-point correlation function of galaxies at mutual (3D) distance r. For distances 
smaller than 10 h^ 1 Mpc, this function can be well approximated as a power law (see Peebles 
1993) 

fW =p^) L7 \ (60) 
where h = H Q / (100 km s _1 Mpc -1 ) is the reduced Hubble constant. 
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We will now evaluate Cov(7) in the weak lensing limit. In this approximation, the observed 
ellipticities are related to the unlensed ones by the relation 

e i = e s i - loi (d)w(z) , (61) 

and thus the expected covariance on e is given by 

Covy(e) = Covy (e s ) + Vax(w)j Oi {e)j Oj (0) = c&a + Vax(w)j O i(O)j Oj (0) ■ (62) 

A plot of Var(u>) as a function of the lens redshift for a "typical" redshift distribution p(z) (see 
Eq. ( [73] ) below) is shown in Fig. |T3|. 

A simple estimator of the shear is given by (see Seitz & Schneider 1997) 

7 (0) = -— Ve^(0,0W) . (63) 

Since the weight function W is assumed to be normalized following Eq. (|46l), this is an unbiased 
estimator. In fact (for the procedure of "spatial averaging" see Paper II) 

1 N r 

<7>(0) = -7-r $>S n) >^(0, * p / W(0, 0') lQ {0') d 2 6' . (64) 
[w)z n=1 Jn 

Thus we recover one simple result already discussed in Paper II: the measured shear is the 
smoothing of the true shear with the spatial weight function W. 

Turning to the covariance of 7, we find 

(65) 



Inserting here Eq. (|BT]), we find a long expression composed of three types of terms: terms of 
the form (e^ n ^- ), terms of the form (e^), and terms independent of e s . Because of the 
isotropy hypothesis, the first contribution vanishes unless n = m and i = j, and the second always 
vanishes. Thus we finally find 

c 0Vii ( 7 ; 0, 0') = -^-cdij J2 w ( d > e (n) )w(o', oW) + 

\ I z n 

W$>(^) - (w) z ] l0t (o^)w(o,e^) x (66) 



HI 

[w(z™) - (w) z ] l0j {6^)W{e',6^) ) . 
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This equation is composed of two terms. The first is related to the intrinsic spread of source 
ellipticities, and in fact it is proportional to c = (Je s | 2 )/2. The second term (second and third 
lines in Eq. (|5o])) is instead related to the spread in redshift of galaxies. In order to obtain a more 
general expression, we adopt a technique already used in Paper II and average the expression for 
Cov(7) over the source positions (0^ } and redshifts [z^ } . As a result, the first term becomes 
simply 

ri = j^c&n Y J w{e ) e^)w(e', e^) .— j^cS ijP [ w(o, o")w(o', o") dV . (67) 

\ w )z ~ \ w /z Jn 

This term has already been studied in detail in Paper II. 



The second term of Eq. ( pq ) deserves a more detailed discussion. In order to carry out 
the average over the source redshifts, we need to split the sum over n and m in two sums, one 
involving the terms for which n = m, and the other involving the terms n ^ m. For the first terms 
we have 

^ = ^(EM^) - M,] 2 7o,(^ ( " ) )7o J (^ ) )W(^^ ( " ) )^(^^ ( " ) ) 

^-l^VarH / l0l (e") l0j (e")w(e,e")w(e',e")d 2 9" . m) 

\ w )z Jn 

Note that this term is related to single galaxies, and thus does not involve the correlation function 
£(r). On the other hand, for pairs of galaxies (i.e. if n ^ m in the last term of Eq. (|66|)) we find a 
rather complicated expression 



(w) 2 



P 2 
z JO 



p(z")dz" / d 2 6" / v {z"')dz"' \ d 2 6'"[w(z") - (w) z ]yoi(9")W(0,0") x 
Jn Jo Jn 

[w(z">) - (w) z \ loj {e"')W{0',0"')t,{r) ■ (69) 



In this expression r is the distance between two galaxies at redshifts z" and z'" and angular 
positions 6" and 6"' . 



The covariance of 7 is the sum of the terms in Eqs. (|67|-|69|), each of which, as discussed 
above, arises from a different source of noise. In summary, 



The contribution of Eq. ( pTD is related to the spread in the intrinsic ellipticities of source 
galaxies (parameterized by c). It is proportional to 1/p (see the normalization condition 
(P6])) and has a correlation length proportional to the correlation length of W. Note that this 
term is proportional to so that no correlation between 7! and 72 arises. 
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• The contribution of Eq. ( |6^ ) is related to the spread in redshift of the sources (parameterized 
by Var(w)). It is proportional to 1/p, has the same correlation length of the previous term, 
and, in addition, is proportional to 7q. This term is not diagonal. 

• The contribution of Eq. ( p9| ) is due to the correlation between galaxies (described by the 
two-point correlation function £(r)). It is independent of the density of galaxies (provided 
£(r) does not depend on p; see comments by Peebles 1993 after Eq. (19.39)), has a 
correlation length somewhat longer than that of the other terms and is proportional to 7q. 
This term is not diagonal. 

A complete description of the effect of the first term has already been provided in Paper II. Its 
effect for the detection of a double lens was analyzed in Sect. |57T| . 

The order of magnitude of the ratio between the term of Eq. ( |68| ) and the one of Eq. ( |67| ) can 
be argued to be 

r, ^ VarH 7o 2 Q 
Fx c 

Typical values for c are c ~ 0.02, and typical values for Var(w) can be estimated from Fig. [T3|. 
Thus, if we refer to a single lens at redshift = 0.2 with a shear of the order of 7 ~ 0.3, we find 
T 2 /Ti ~ 0.2. Note that, since calculations here are performed in the weak lensing approximation, 
the shear cannot be much larger than 0.3, and thus the ratio obtained should be taken as an 
(approximate) upper limit. In any case, the effects of this term will be included in the simulations 
that will be described in Sect. 6.2. 

As to the third term T 3 , it is clearly very complex, both in relation to its mathematical 
structure and to the physical ingredients involved. Therefore, instead of attempting here an order 
of magnitude estimate for it, we prefer to postpone a complete discussion of its impact to a future 
paper that will make use of suitable simulations in the cosmological context. 



5.3.2. Bias on the reduced shear 

Another effect encountered when source galaxies are "placed" at different redshifts is a bias 
on V A it, which is not bound to vanish any more. This effect could, in principle, be misinterpreted 
as the signature of a double lens. Do we have to worry about this contribution? Note that if this 
effect turned out to be significant, it would have an important impact on all single lens mass 
reconstruction methods, because these are all based on the curl-free character of the shear induced 
by gravitational lensing. 
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In order to be more specific about the origin of this effect, we start from some well-known 
results (Seitz & Schneider 1997; see also Paper III). Let us consider a sample of galaxies 
with redshift distribution p(z), behind a single cluster at redshift z d . As the critical density 
E c (z) depends on the source redshift, the lens acts with differential strength on the lensed 
galaxies (see Sect. 2.1). The cosmological weight function w(z) controls the ray-tracing 
transformation. In particular, the (redshift dependent) reduced shear takes on the form 
g(0,z) = w(z) r Y(0)/\l — w(z)k(6)\. The fact that w(z) enters the expression of g(6,z) 
nonlinearly complicates the lensing analysis. Note that even though the relevant ray-tracing 
transformation for a single lens is symmetric (e.g., see Eq. (9) of Paper III), because of the spread 
in redshift distribution the arguments generally used to identify a curl-free field u do not hold any 
more. In fact, if we observe a single lens and just ignore the redshift distribution of the sources, we 
would naively start by identifying the observed average ellipticity with the reduced shear, using 
the relation (e) = —g. This mistake would lead in general to a quantity u, defined by Eq. (]3(J), 
with non vanishing curl. 

For subcritical lenses Seitz & Schneider (1997) have shown that an accurate, but 
approximate, lensing analysis can be carried out as if all galaxies were at a single redshift, 
provided the reduced shear g{0) in Eq. ( p0| ) is evaluated from the average ellipticity following the 
relation (e) (w 2 ) z /(w)\ = —g. Correspondingly, the dimensionless density distribution obtained 
from the lens reconstruction changes by a factor (w 2 ) z / (w) z . If the above corrected relation 
between average ellipticity and reduced shear entering the definiton of u is kept into account, 
then, in this approximate description, the "spurious curl" (that would be associated with the naive 
use of (e) = —g) is eliminated. 

In order to evaluate the magnitude of the effects (described in this subsection) associated 
with the spread in the distances of the source galaxies, we have carried out simulations that will 
be described in the next section. Here we summarize the main results. Using a reasonable redshift 
distribution for the sources, and for a wide range of redshifts z& of the single deflector, we find an 
extremely small value of V A u. For example, for a nearly critical single lens at redshift z<j = 0.3 
the value of the relevant curl is about 50 times smaller than the value associated with the double 
lens effect addressed in this paper. The nearly critical condition adopted in this example certainly 
overemphasizes the signal, with respect to the case of weaker lenses, and is unfavorable to the 
approximation suggested by Seitz & Schneider. Still, if we apply the simple correction factor 
(w 2 ) z / (w) 2 . described above, we gain an extra factor of 4. We can thus state that the effect of 
the redshift distribution discussed in this subsection, in the example considered, is contained to 
be about 200 times smaller than the effect of double lensing we are interested in. From this we 
conclude that the simple arguments and derivations provided in the main part of the paper are not 
affected by the spread in the distances of the source galaxies significantly. 



6. Simulations 



The main goal of this section is to check the analytical framework developed in the paper and 
to provide convincing evidence that effects characteristic of double lensing are within reach of the 
observations. For this reason we follow two types of investigation: a semi-analytical approach 
(Sect. 6.1), where we test the applicability of our asymptotic analysis, and a study (Sect. 6.2) that 
may be seen as a numerical observation, where we simulate the distorsions of a set of source 
galaxies in a given field under realistic conditions (see below) and we perform the statistical 
lensing analysis on the observed ellipticities. For simplicity, the simulations are carried out under 
the assumption that no clustering is present in the source population (Eq. ( ]59| ) with £(r) = 0). 

6.1. Check of the analytical framework 

A first test is based on the following steps of a semi-analytical approach: 



The population of source galaxies is taken to be at a single redshift (z B = 2.0). 

A simple model of a double lens is chosen. For simplicity, two centrally-symmetric 
projected mass distributions are used, within a favorable geometric configuration (see 
Sect. 4.1). 

The two clusters are taken to be located at redshift z^ 1 ' =0.1 and = 0.4. Since the 
source population and the double lens we have in mind have such large distances, the values 
of the cosmological parameters and ^a need to be specified. In the following we refer to 
the case = 0.3, ^a = 0.7; moreover, we adopt H = 65 km s _1 Mpc -1 . 

The two functions /3 (1) (0) and /3 {2) (0) are calculated. Thus the ray-tracing function 6 s (6) 
is derived (see Eq. (Q)). 

The Jacobian matrix A o (0) is calculated from the map s (6). This allows us to calculate 
the various quantities defined in Eqs. (|1"8|-[2"1"|). In the following, these are taken to hold as 
definitions also for strong lenses. 

The "effective" Jacobian matrix A s (0) is obtained from A o (0) using Eq. (fT3|). 

From A s (8), the reduced shear g(6) and the field u(6) are calculated. 

The divergence and the curl of u are compared with the Laplacians of ln(er) and r (Eqs. ( |3T1 ) 
and (B2b). 
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Note that, by following this procedure, we are considering the "effective" Jacobian matrix A s 
without errors. We do so, because errors on A s are discussed separately (see next Subsection 6.2, 
and Paper I and II) and because we are focusing here mostly on a test of the adequacy of Eqs. ( J3T] ) 
and (fJ2|). Let us now describe some of the steps in further detail. 

The two clusters, taken to be located at redshift z^ 1 ' =0.1 and = 0.4 (so that the value of 
the parameter A defined in Eq. (|]) is A ~ 0.794 in the assumed cosmological model), have been 
modeled using the following profiles (close to isothermal profiles) for the projected density (see 
Schneider, Ehlers, & Falco 1992): 

with 9 = \\0\\. Here k c = k(0) is the maximum value of the projected density, and 9 C is a length 
scale (core radius). The deflection (3(6) for such profiles takes a particularly simple form: 

13(6) = - Kc 6 . (72) 

a/i + (0/0c) 2 

The intrinsic physical parameters of the two clusters, i.e. the core radius in Mpc and central 
density in kgm~ 2 (1 kgm~ 2 ~ 478Af Q pc~ 2 ) have been assumed to be equal. For simplicity, the 
first cluster is centered at 6 = (5', 3' 30") and the second cluster at 6 = (5', 6' 30"). In addition, 
we have considered two cases: a case of "weak" lenses, for which the second order expansion 
considered in Sect. 3 is expected to provide accurate results, and a case of "strong" lenses, with 
"joint mass density" 1 — a too high in relation to the asymptotic expansion. For the strong case 
the mass associated within our field of 10' x 10' for the far cluster is M ~ 3.14 x 10 15 M . The 
latter case will show that, at least qualitatively, the analysis developed in this paper is applicable 
even to relatively strong lenses. The cluster parameters adopted for the tests presented here are 
summarized in Table 1 . 

In Figs. ||and |3] we display the combined luminosity and dimensionless density distributions. 

The second step has been the calculation of the ray-tracing function 6 s (6). This function 
has been calculated on a grid of 100 x 100 points, representing a square with a side of 10'. 



Table 1: The parameters used for the simulations. 



component redshift density core radius 







z 




At c E c (kgm~ 2 ) 


9 C (angle) 


r c (Mpc) 


weak 


cl 1 


0.1 


0.054 


0.5 


2' 31" 


0.3 


#2 


0.4 


0.120 


0.5 


0' 52" 


0.3 


strong 


cl2 


0.1 


0.401 


3.7 


2 31" 


0.3 


#2 


0.4 


0.891 


3.7 


0'52" 


0.3 
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The Jacobian matrix has been calculated on the same grid using the approximate derivatives 
determined from the differences of s in neighboring points. 

The plot of the semi-trace a of A (see Eq. (|19|)) presents some interesting features. We first 
note that for weak lenses a = 1 — k^ 1 ' — k^. However, when the lens is strong, a curious effect is 
noted (cfr. Fig. |]): the maximum corresponding to the mass distribution of the second cluster is 
significantly smoothed and lowered. This has to do with the coupling of the two deflectors. The 
effect can be broadly described as a magnification of the mass distribution of the second deflector 
because of the lensing effect of the first cluster. In fact, while the value of + exceeds 
unity (see Fig. 0), the double lens remains sub-critical with 1 — a below unity. 

Proceeding along the list of steps indicated above, we have calculated the reduced shear g{0) 
and the field u(Q). At the end of the process, the two quantities V ■ u and V A it are available on 
a 100 x 100 grid. Note that all differentiations have been performed using a 3-point, Lagrangian 
interpolation. 

The results of our tests for the weak case are summarized in Figs. |5|-|7]. Within the numerical 
errors, we find that Eqs. ( PTj ) and (^) are verified. Of course, since in this case the magnitude of 
the effect is very small (the maximum value of r is approximately 2 x 10~ 4 ), we should not expect 
that double lensing of this type could be actually detected. In passing we note that the order of 
magnitude for the various quantities shown in the figures is consistent with this value of r. 

The tests for the strong case (see Figs. |8|-|10|) are rather surprising. In fact, Fig. |8| shows that 



Eq. ( [Hj ) holds approximately also in this case. However, Eq. ( TJ2] ) turns out to be inadequate, as 
shown by Fig. This behavior reflects the fact that the lens configuration here has a rather high 
value of 1 — a (its maximum is about 0.928), and a small value of r (maximum about 0.01). A 
heuristic way to correct Eq. (^2|) is the following: 

VA« = V- (—) . (73) 



The use of a in the denominator of the previous equation is suggested by analogy with Eq. (pit), 
where V(lner) = (Ver)/cr is used. Moreover, we note that, to second order, Eq. ([73]) is identical 
to Eq. (|3"2"|). The new equation turns out to be a better approximation than Eq. (Q), especially for 



regimes away from the strong and the weak limiting cases considered above; in such intermediate 
cases Eq. ( f73| ) performs better, typically by a factor of 2. 

Finally, we have considered the problem of the dark cluster (see Sect. 4.4). In this case the 
simplified simulations in the semi-analytical approach described above have been performed with 
the aim of demonstrating how the redshift of a dark cluster can be determined when the redshift 
and the mass distribution of the luminous cluster are known. In order to find the unknown redshift 
we have minimized S, the integral of the square of the left hand side of Eq. ( p3| ) or of Eq. (p4|), 
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depending on whether the dark cluster is considered to be closer to us than the luminous cluster 
or not. Simulations have been performed using the parameters of the strong lens configuration, 
by taking either the first or the second cluster as the dark cluster. Figure [TT] shows S for the case 
where the dark cluster is in the front, while Fig. [T2| shows the same quantity for the case where the 
dark cluster is behind. Note that S has a minimum (near or equal to 0) for the true redshift of the 
dark cluster. Note also that the second minimum of S occurs where the condition A = is met: 
in this case the quantities Q x \ Q 2 \ and £ s all reduce to 6. We should stress that if a dark cluster 
is detected from a double lens signature, as described in this paper, and if the key characteristics 
of the bright cluster are known from independent diagnostics, then the reconstruction method 
developed here leads to the full determination of both the distance and the mass distribution of the 
invisible cluster. 



6.2. More realistic simulations 

In order to ascertain the relevance of our analysis with respect to present and future 
observations, we have performed a set of simulations described in the following steps: 

• We generate N source galaxies. Each galaxy is described by its location in the sky, its 
redshift, and its source ellipticity. Locations are chosen following a uniform distribution 
on the observed locations 6 (thus neglecting the magnification effect). The redshift of 
source galaxies has been drawn from a gamma distribution used also by other authors (e.g., 
Brainerd et al. 1996) 

z 2 

v{z) = 7^3 exp[-(z/zo)] , (74) 

with zq = 1/3 (other values of z have also been tested). Finally, ellipticities e s are taken 
from a truncated Gaussian distribution (see Eq. (32) of Paper III) characterized by c = 0.02. 
This value is probably unrealistically large; it is adopted in order to test our framework 
under relatively unfavorable conditions. 

• As a model for the double lens we use the "strong" double lens model introduced in the 
previous subsection, within the same field of view of 10' x 10'. 

• Source ellipticities e s are transformed into observed ellipticities e using the true asymmetric 
Jacobian matrix associated with the lens. Note that the redshift of each galaxy has been 
taken into account when performing this transformation. 
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• The statistical lensing reconstruction procedure is applied to the observed ellipticities. The 
reduced shear map g(0) is derived by averaging observed ellipticities of nearby galaxies, 
from the relation (e) = —g properly corrected for the redshift distribution. 

• From the shear map we calculate the other relevant quantities: the vector field u{0), its 
divergence, and its curl. 

• We then look for significant peaks in the curl map, which would reveal the double nature of 
the lens. 



Several simulations have been performed with different source galaxy densities. The results 
obtained are very encouraging, since a significant signal in the the curl of u is already detected 
with a galaxy density of 100 gal arcmin -2 . Such relatively high density has already been reached 
by current observations. For example, new observations with the Very Large Telescope of the 
cluster MS 1008. 1-1224 come close to it (Lombardi et al. 2000, Athreya et al. 2000; see also 
Hoekstra et al. 2000 for observations of MS 1054 with the HST). 



Figure [T4| shows a typical result obtained in our simulations. The figure shows the density 
plot of the observed V A u and, superimposed, the contour plot of V A u , i.e. the true quantity 
that we should measure in the limit of infinite galaxy density. For both plots we have used the 
same smoothing spatial weight function, a Gaussian of width <7\y = 30" (see Sect. 5.2). We thus 
conclude that, given the conditions adopted in our simulations, the characteristic signature of 
double lensing should be within reach of current observations. 

Finally, in order to test the reliability of the detection, we have performed analogous 
simulations using a single cluster. In this case, as explained above, we expect a (nearly) vanishing 
V A u, especially after the correction for the spread of background sources is applied (see 
Sect. 5.3.2). Figure [15] shows a map of this quantity (with the correction applied) in the case 
of a single lens at redshift 0.1 (the parameters of this lens are the same as those of the first lens 
considered in the double configuration). Note that the noise observed is well below the signal 
shown in Figure [T3]. 



7. Conclusions 

In this paper we have generalized the statistical analysis at the basis of mass reconstructions 
of weak gravitational lenses from the standard case of single lenses to the case of double lenses. 
From one point of view, this study leads to practical tools to deal with "errors" that might occur 
when the standard single lens analyses are applied to the not infrequent cases where two massive 
clusters happen to be partially aligned along the line of sight. In reality, our study of double lenses, 
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based on the analytical framework mentioned in the first item below then checked and extended 
by means of simulations, has opened the way to a number of interesting results. This is a list of 
the main points made in our paper: 

1 . A consistent analytical framework has been constructed where the contribution of the small 
asymmetry in the Jacobian matrix induced by double lensing is retained to two orders in the 
weak lensing asymptotic expansion. 

2. Given the known properties of the distribution of (bright) clusters of galaxies, we have 
shown that a few configurations are likely to be present in the sky, for which the small 
effects characteristic of double lensing may already be within detection limits. 

3. As a separate astrophysical application, we have demonstrated that, if the characteristic 
signature of double lensing appears in an observed configuration where a single bright 
cluster exists, with properties well constrained by independent diagnostics, the location and 
the mass distribution of the dark cluster, if present and responsible for the effect, can in 
principle be reconstructed unambiguously. 

4. We have checked that the redshift distribution of the source galaxies should not confuse the 
signature of double lensing. 

5. An examination of the relevant contributions to the noise of the shear measurements has 
brought up a limitation of weak lensing analyses related to the clustering of source galaxies. 
In particular, weak lensing studies of single clusters are found to be characterized by a 
lower limit for the expected noise, regardless of the depth of the images and of the density 
of source galaxies. A firm quantitative estimate of this surprising effect will be provided an 
a separate article. 



This work was partially supported by MURST of Italy. We thank Peter Schneider for several 
stimulating discussions that have helped us improve this paper significantly. 



A. Second order weak lensing analysis 

In this Appendix we derive Eqs. (|T[) and (J32|). 



-30- 



We start from the definition of u, which, to second order, is 



u 



c,i cr,2 
C,2 -CT.l 



a + 71 
72 2 

'71 



72 

a -71 



7i,i + 72,2 
72,1 - 7l,2 



72 



(Al) 



We recall that cr = 1 + 0(e). By inserting Eqs. (|19|-pl|) here, rather long expressions are found: 

1 



Ml= 4 



02,12 (#2,2 - + (#1,2 + ^2,l) V A ^1 

1 



«2 



4L 



4V 2 6 I 2 - 2 o 2 V ' s ~~ 2 ^2,ii^i,i _ 



01,12 (02,2 - ^1,1) - (^1,2 + 02,l) V A ; 



(A2) 



(A3) 



From these expressions we can calculate the divergence and the curl of u. The expressions 
obtained by a straightforward application of the definitions contain also terms of order e 3 or 
higher, which are to be discarded. 

Let us start from the curl of u. From the previous equations, after some manipulations, we 
obtain 



1 



V Aw=V 2 (V A s 

01,11 v a s , + 05 i22 v a e% + 

ns ns ns ns 

!7 1,12 (7 1,22 C7 2,ll f7 2,12 



0i,i va0 :h+02,2 va0 :22+ 



(A4) 



The terms in this expression are organized by rows. Recalling that V A 6 s = 2r ~ e 2 , we 
recognize that the terms in the second and third lines are of order e 3 , and thus should be dropped. 
A simple analysis also shows that the two terms in the last line cancel out. Finally, as to the first 
line, we note that 6\ x and 6\ 2 are of the form 1 + 0(e). Thus at the end we obtain 



VAu~ ^V 2 (VA0 s 



(A5) 



that is Eq. 



Let us now turn to the divergence of u. From Eqs. ( |A2| ) and (|A3D, a rather long calculation 
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(A6) 



gives 

V-£=V 2 (V-0 s ) + 

- V • (O l l2 2 ~ 01,111 + ^2,112 — ^2,222) 

4 #1, 1 #2,112 _ 4^2,2^1,122 ~ 
V 2 0i(0i,n - 6*1,22 + 2 02,12) + 
V^O^ll _ ^2,22 ~~ 26 *1,12) + 

fc+y(VA0; u -VAr 22 )" 

The first line of this expression needs no special explanations, since we recall that V • s = 2a. 
The sum of the terms in the second and third lines can be shown to be equal (dropping terms 
of order e 3 or higher) to — 4crV 2 <r. By replacing 2# 2 12 by 26\ 22 and 26\ 12 by 26> 2 n (which is 
allowed to second order), the terms in parentheses in the fourth and fifth lines become respectively 
V 2 9\ and V 2 9 s 2 . Finally, the last line can be discarded, because it contains two terms of order e 3 . 
In conclusion, we obtain 

V u~ 2VV-<tV 2 ct- i||V 2 s || 2 . (A7) 

411 11 

This expression can be replaced by V 2 (ln a). In fact, to second order we have 

V 2 (W) = (2 -a)V 2 o - ||V(t|| 2 . (A8) 

We now apply the vector identity V(V • A) = V 2 A — V A V A A (valid for any vector field A) 
to the last term of Eq. (gg): 

||V(V ■ S )|| 2 =||V 2 S || 2 - 2(V 2 S ) • (V A V A s ) + 

2 , , l|2 (A9) 
|| V A VA0 s || = ||V 2 S || , 

where the last equality holds because V A s = 2r is of order e 2 . Thus finally we can rewrite 
Eq. (J5§) as 

V 2 (lna) = (2 - a)V 2 a - i||V 2 s || 2 . (A10) 
Comparing this expression with Eq. ( |A7| ) we obtain the desired relation (|3"T|). 
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Fig. 1 . — The geometrical configuration of a double lens system: the observer O, the two deflector 
planes V x and T> 2 , and the source plane S. 




Fig. 2. — This plot shows the light distribution, in arbitrary units, that would be observed if the two 
clusters had equal and constant mass-to-light ratio. Note that the peak corresponding to the center 
of the far cluster is slightly higher, because of the contribution by the near cluster, which appears 
to be more diffuse. 



This preprint was prepared with the AAS IATgX macros v4.0. 
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Fig. 3. — The sum of the dimensionless mass distributions k^\6) and k^ 2 \6) for the strong 
case. Note that, although the two clusters are physically identical, the near cluster appears 
to be significantly weaker and more diffuse. The maximum value reached by the combined 
dimensionless mass distribution is + ~ 1.072. However, the lens is still sub-critical 
(the maximum value for 1 — a is about 0.928). 



-o.i 




Fig. 4. — The quantity 1 — a — (k^ + k^) plotted here for the strong case shows a "hole" and 
a "peak" which can be understood in terms of the lens effect of the near cluster on the "effective" 
mass of the second cluster. The quantity shown here vanishes only in the weak lensing limit. 
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Fig. 5. — The difference V ■ u — V 2 In a on the 0-plane for the weak case. As described in the 
text, to second order this quantity is expected to vanish (Eq. (plj)). For comparison, we note that 
the maximum value of I V • u\ is about 0.00647. 




Fig. 6. — The difference V A u — V 2 r on the 0-plane for the weak case. As described in the text, 
to second order this quantity is expected to vanish (Eq. (|3"2|)). The maximum value of | V A u\ is 
about 3.37 x 10~ 5 . 
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Fig. 7. — The difference V A u — V • (Vr/cr) on the 0-plane for the weak case. To second order, 
V • (Vr/cr) = V 2 r, and in fact the plot shown here is very similar to that of Fig. |6|. 

-o.ooi ■ 




Fig. 8. — As in Fig. |5], but for the strong case. Note that the difference shown remains small. For 
comparison, the maximum value of | V • u\ is about 0.331. 

-o.ooi ■ 
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Fig. 9. — As in Fig. but for the strong case. The maximum value of | V A u\ is about 0.0167. 
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Fig. 10. — As in Fig. [7], but for the strong case. 
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Fig. 11. — The detection of a dark cluster at redshift z = 0.1. The value of the integral S (see 
text for definition) is shown as a function of the redshift z of the dark cluster. Note that S does 
not exactly vanish at z = 0.1 because of discretization errors. The luminous cluster is at redshift 
z lum = 0.4. 
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Fig. 12. — As for Fig. [TT|, but with the luminous and the dark cluster locations swapped. The 
function S has been computed on sampled redshifts, and for this reason the minimum at z = 0.1 
appears to be somewhat jagged. 
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Fig. 13. — Variance of w as a function of the lens redshift z d (O = 0.3, = 0.7). The assumed 
source probability distribution is given by Eq. ([74|). 
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Fig. 14. — Density plot of V A u for the "strong" double lens. This plot has been obtained 
by simulating the observation of galaxies in a field of 10' x 10'. The adopted galaxy density is 
100 gal arcmin -2 . The superimposed contour plot describes the quantity V A u , after suitable 
smoothing. Contours are at levels [±3, ±9, ±15, ±21, ±27] x 10~ 5 (positive contours are solid, 
negative ones are dashed). Because of a different choice for the unit of length, the contour levels 
of V A u shown here should not be compared directly with those plotted in Fig. |9|. The plane 
represented in the figure is precisely the (8i, 8 2 ) plane as drawn in Fig. ^|. 
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Fig. 15. — Same as Fig. [T^ but for a single lens at redshift z = 0.1. Equal gray levels in the two 
pictures correspond to equal values for V A u. A comparison of the signal in the previous figure 
with the noise observed here provides an estimate of the detection significance of a double lens. 
Note that most of the noise in this map is due to the finite ellipticity of source galaxies and, to a 
smaller extent, to the spread in redshift of galaxies. 



